Combination of Classifier Cascades and Training Sample Selection for Robust Face Detection

نویسندگان

  • A. Waibel
  • Lorant Szasz-Toth
چکیده

Face detection is one of the most fundamental tasks in human-computer-interaction, surveillance, and, more recently, image retrieval. Determining the location and size of faces in input images is a prerequisite for many other applications, including face recognition. In recent years several breakthroughs have been made in this field. These days, face detectors deliver high detection rates, low false alarm rates and run in real-time. Despite the efforts and publicly available tools, training high-performance face detectors from scratch remains a challenge. Mostly, because training time for a single cascade can be in the order of days and various training parameters have to be chosen carefully. Usually, training involves acquiring heuristics and a feeling for the intricacies of the training process and the influence of training parameters. A substantial amount of time is spent training classifiers iteratively and modifying parameters, while usually discarding intermediate results. The goal of this work is to overcome some of the problems of training cascade classifiers and to promote the use of custom-trained classifiers. Specifically, two problems are addressed in this work. First, an approach to combine several trained cascade classifiers into a single cascade is presented and evaluated. Second, a technique to optimize the training set is explored. A major challenge during cascade training is the choice of training parameters. There is no ideal way to choose these parameters and optimization is not feasible. Usually, the process involves several attempts or guesses at the right parameters and, finally, the best performing classifier is selected. Instead of discarding intermediate results, several of these classifiers are combined into a single new classifier. Unlike previous work, the base classifiers are not run in parallel but a fixed number of individual classifier stages are optimized, selected and combined into a new classifier without added run-time overhead. Experiments have shown the importance of a proper choice of training samples. Classifiers trained with a reduced amount of well-chosen samples can outperform a classifier that was trained on a far larger training set. The use of less training samples to achieve the same performance decreases the required training time, especially with large training sets, where results cannot be cached. Additionally, forcing the classifier to focus on difficult training examples has shown to increase classification performance. Therefore, a method to select an optimized set of training samples from a large set with the help of support vector machines is explored. The results of both presented approaches have been evaluated on the widely used, publicly available CMU+MIT database. Both the SVM-based training sample selection and the cascade combination approaches are shown to improve the performance over the base classifiers. Cascade combination allows to generate a classifier within a single day that performs nearly as well as a single, high-performance classifier trained in more than ten days. Additionally, classifiers generated by cascade combination outperform the orignal base cascades. Face detectors trained with SVM-based training set selection perform better than equally trained base classifiers with a random choice of training samples. Both presented approaches were able to produce cascade classifiers that clearly outperform the publicly available OpenCV face detectors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-View Face Detection in Open Environments using Gabor Features and Neural Networks

Multi-view face detection in open environments is a challenging task, due to the wide variations in illumination, face appearances and occlusion. In this paper, a robust method for multi-view face detection in open environments, using a combination of Gabor features and neural networks, is presented. Firstly, the effect of changing the Gabor filter parameters (orientation, frequency, standard d...

متن کامل

Fault Detection and Classification in Double-Circuit Transmission Line in Presence of TCSC Using Hybrid Intelligent Method

In this paper, an effective method for fault detection and classification in a double-circuit transmission line compensated with TCSC is proposed. The mutual coupling of parallel transmission lines and presence of TCSC affect the frequency content of the input signal of a distance relay and hence fault detection and fault classification face some challenges. One of the most effective methods fo...

متن کامل

Hierarchical PSO-Adaboost Based Classifiers for Fast and Robust Face Detection

We propose a fast and robust hierarchical face detection system which finds and localizes face images with a cascade of classifiers. Three modules contribute to the efficiency of our detector. First, heterogeneous feature descriptors are exploited to enrich feature types and feature numbers for face representation. Second, a PSO-Adaboost algorithm is proposed to efficiently select discriminativ...

متن کامل

Improved Face Detection Using Spatial Histogram Features

In this paper, we improve an object detection approach using spatial histogram features, by applying classifier ensemble. The spatial histogram features can preserve texture and shape information of an object, simultaneously. We train a hierarchical classifier by combining cascade histogram matching and the combination of Multi Layer Perceptrons. The cascade histogram matching is trained via au...

متن کامل

Sample-oriented Domain Adaptation for Image Classification

Image processing is a method to perform some operations on an image, in order to get an enhanced image or to extract some useful information from it. The conventional image processing algorithms cannot perform well in scenarios where the training images (source domain) that are used to learn the model have a different distribution with test images (target domain). Also, many real world applicat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009